PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID EMT03247
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Pooideae; Triticodae; Triticeae; Triticinae; Aegilops
Family HD-ZIP
Protein Properties Length: 829aa    MW: 89861.6 Da    PI: 7.7853
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
EMT03247genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox56.35.4e-1868118656
               S--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
  Homeobox   6 tftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
               ++t+ q++eL++lFe++++p++++r+ L k+++L+ +qVk+WFqNrR++ k
  EMT03247  68 RHTPRQIQELQALFEQCPHPDEKQRAALGKRVRLQPSQVKFWFQNRRTQLK 118
               789********************************************9876 PP

2START1511.1e-472935382203
               HHHHHHHHHHHHHHHC-TT-EEEE........EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHH.HHHHHHCCCGG..............CT-T CS
     START   2 laeeaaqelvkkalaeepgWvkss........esengdevlqkfeeskv.....dsgealrasgvvdmvla.llveellddke..............qWde 74 
               la +a++elvk+a+ +ep+W   +        e +n +++l++f+++ +     + +ea+r+sg+v  +   +lve+lld  +              +W++
  EMT03247 293 LATSAMDELVKMAQMDEPLWIPNVplpgslakETLNHEVYLRTFSPAIGvkppgFVSEASRESGIVSSDGSvELVETLLDEVAlislcppfsllfprRWSR 393
               7889*****************999999*******************999****************98774449**************************** PP

               T-S....EEEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--......-TTSEE-EESSEEEEEEEECT CS
     START  75 tla....kaetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppe.....sssvvRaellpSgiliepksn 159
                +     + +t+e is+g      gal lm+a+lq+lsplvp R++ f+R+++qlg+g w++vdvS d   + +      +   +++++lpSg+++++++ 
  EMT03247 394 FFSciiaESSTTEEISTGaagnrhGALLLMQAKLQVLSPLVPvREVAFLRFCKQLGEGLWAVVDVSMDGLGMEQGlavasTTANMKCRRLPSGCVVQDTPS 494
               *******************************************************************988877766666688899**************** PP

               CEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXX CS
     START 160 ghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrq 203
               g +kvtwveh+ ++++++h+l+r+l++sgla ga +w+atlqrq
  EMT03247 495 GFCKVTWVEHTVYDESSVHQLYRPLLRSGLALGAGRWLATLQRQ 538
               ******************************************98 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466898.13E-1752118IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.608.5E-1854118IPR009057Homeodomain-like
SMARTSM003891.2E-1559124IPR001356Homeobox domain
PROSITE profilePS5007115.67760120IPR001356Homeobox domain
CDDcd000862.94E-1660118No hitNo description
PfamPF000461.7E-1568118IPR001356Homeobox domain
PROSITE profilePS5084839.452283544IPR002913START domain
SuperFamilySSF559611.65E-19286539No hitNo description
CDDcd088751.11E-96287538No hitNo description
SMARTSM002345.4E-29292541IPR002913START domain
PfamPF018525.9E-38293538IPR002913START domain
SuperFamilySSF559611.02E-12560669No hitNo description
SuperFamilySSF559611.02E-12703791No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 829 aa     Download sequence    Send to blast
MAKSFDPLKF VLPWGLHAAG VVKAEDGSRP MTRHLHAVSG GDAGGGEDDE VAPEPGRHGK  60
RKRPPNARHT PRQIQELQAL FEQCPHPDEK QRAALGKRVR LQPSQVKFWF QNRRTQLKVI  120
NQSDRALQHS RSFTRALTTR LIPLCGGSSC KRESLMHENQ LLMAENGKLR AETLSLRVEM  180
AHPVCGGCGG PVPAMLDESP SLEEDLRVQN ATLKNELSRL CALASAFLGR SISFVAAPPT  240
PGSPVERIGS MPSATVANST VTEFTGSPSS QTVGMAITPM AKAGIDRSVF LELATSAMDE  300
LVKMAQMDEP LWIPNVPLPG SLAKETLNHE VYLRTFSPAI GVKPPGFVSE ASRESGIVSS  360
DGSVELVETL LDEVALISLC PPFSLLFPRR WSRFFSCIIA ESSTTEEIST GAAGNRHGAL  420
LLMQAKLQVL SPLVPVREVA FLRFCKQLGE GLWAVVDVSM DGLGMEQGLA VASTTANMKC  480
RRLPSGCVVQ DTPSGFCKVT WVEHTVYDES SVHQLYRPLL RSGLALGAGR WLATLQRQYD  540
CLDVLMPKQD SSDAMLEGSR SLLRLAERMM DNFCAGVSAS SAEWSKLDDV TGSIGEDVRI  600
MARRSVDEPG VPAGVVLCAA TSVWMLVTPK RLFNFLCNEK TRAEWDILSK GGPMQEVTKI  660
YKGQQNGNAV SLLKATRRGV ILLFCKKKLI LSMMTNEACE VEVIFLRKNA PNTHQDSSIL  720
ILQETCTDAS GSMVVYTPVD IPAMRRVMGG EGDPASIMLL PSGFAILPGG PSMSGDDGHK  780
TCGSLLTVAF QILGNRQPTG KLTVESVQTV DSLISCTINR IKTALRCDA
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_003581542.10.0PREDICTED: homeobox-leucine zipper protein ROC4
SwissprotQ7Y0V90.0ROC4_ORYSJ; Homeobox-leucine zipper protein ROC4
TrEMBLN1QQP70.0N1QQP7_AEGTA; Homeobox-leucine zipper protein ROC4
STRINGBRADI5G18717.10.0(Brachypodium distachyon)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP120337116
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G61150.10.0homeodomain GLABROUS 1